AITopics | consistent estimation

Entropy Rate Estimation for Markov Chains with Large State Space

Neural Information Processing SystemsMar-16-2026, 23:27:50 GMT

Entropy estimation is one of the prototypical problems in distribution property testing. To consistently estimate the Shannon entropy of a distribution on $S$ elements with independent samples, the optimal sample complexity scales sublinearly with $S$ as $\Theta(\frac{S}{\log S})$ as shown by Valiant and Valiant \cite{Valiant--Valiant2011}. Extending the theory and algorithms for entropy estimation to dependent data, this paper considers the problem of estimating the entropy rate of a stationary reversible Markov chain with $S$ states from a sample path of $n$ observations.

artificial intelligence, machine learning, proceedings, (12 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

866d90e0921ac7b024b47d672445a086-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 06:13:26 GMT

dataset, mixture component, mixture model, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
North America > Canada (0.04)
Europe > Germany > Berlin (0.04)
Asia (0.04)

Industry: Government > Regional Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)
(2 more...)

Add feedback

On the consistent estimation of optimal Receiver Operating Characteristic (ROC) curve

Neural Information Processing SystemsDec-25-2025, 05:51:36 GMT

Under a standard binary classification setting with possible model misspecification, we study the problem of estimating general Receiver Operating Characteristic (ROC) curve, which is an arbitrary set of false positive rate (FPR) and true positive rate (TPR) pairs. We formally introduce the notion of \textit{optimal ROC curve} over a general model space. It is argued that any ROC curve estimation methods implemented over the given model space should target the optimal ROC curve over that space. Three popular ROC curve estimation methods are then analyzed at the population level (i.e., when there are infinite number of samples) under both correct and incorrect model specification. Based on our analysis, they are all consistent when the surrogate loss function satisfies certain conditions and the given model space includes all measurable classifiers. Interestingly, some of these conditions are similar to those that are required to ensure classification consistency. When the model space is incorrectly specified, however, we show that only one method leads to consistent estimation of the ROC curve over the chosen model space. We present some numerical results to demonstrate the effects of model misspecification on the performance of various methods in terms of their ROC curve estimates.

consistent estimation, optimal receiver operating characteristic, receiver operating characteristic, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Consistent Estimation for PCA and Sparse Regression with Oblivious Outliers

Neural Information Processing SystemsDec-24-2025, 23:41:22 GMT

Previous works could obtain non-trivial guarantees only under the assumptions that the measurement noise corresponding to the inliers is polynomially small in $n$ (e.g., Gaussian with variance $1/n^2$).To devise our estimators, we equip the Huber loss with non-smooth regularizers such as the $\ell_1$ norm or the nuclear norm, and extend d'Orsi et al.'s approach~\cite{ICML-linear-regression} in a novel way to analyze the loss function.Our machinery appears to be easily applicable to a wide range of estimation problems.We complement these algorithmic results with statistical lower bounds showing that the fraction of inliers that our PCA estimator can deal with is optimal up to a constant factor.

consistent estimation, name change, pca and sparse regression, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.42)

Add feedback

Consistent Estimation of Identifiable Nonparametric Mixture Models from Grouped Observations

Neural Information Processing SystemsDec-24-2025, 06:26:33 GMT

Recent research has established sufficient conditions for finite mixture models to be identifiable from grouped observations. These conditions allow the mixture components to be nonparametric and have substantial (or even total) overlap. This work proposes an algorithm that consistently estimates any identifiable mixture model from grouped observations. Our analysis leverages an oracle inequality for weighted kernel density estimators of the distribution on groups, together with a general result showing that consistent estimation of the distribution on groups implies consistent estimation of mixture components. A practical implementation is provided for paired observations, and the approach is shown to outperform existing methods, especially when mixture components overlap significantly.

consistent estimation, identifiable nonparametric mixture model, name change, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.42)

Add feedback

Consistent Estimation of Functions of Data Missing Non-Monotonically and Not at Random

Neural Information Processing SystemsNov-21-2025, 14:53:22 GMT

Missing records are a perennial problem in analysis of complex data of all types, when the target of inference is some function of the full data law. In simple cases, where data is missing at random or completely at random (Rubin, 1976), well-known adjustments exist that result in consistent estimators of target quantities. Assumptions underlying these estimators are generally not realistic in practical missing data problems. Unfortunately, consistent estimators in more complex cases where data is missing not at random, and where no ordering on variables induces monotonicity of missingness status are not known in general, with some notable exceptions (Robins, 1997), (Tchetgen Tchetgen et al, 2016), (Sadinle and Reiter, 2016). In this paper, we propose a general class of consistent estimators for cases where data is missing not at random, and missingness status is non-monotonic. Our estimators, which are generalized inverse probability weighting estimators, make no assumptions on the underlying full data law, but instead place independence restrictions, and certain other fairly mild assumptions, on the distribution of missingness status conditional on the data. The assumptions we place on the distribution of missingness status conditional on the data can be viewed as a version of a conditional Markov random field (MRF) corresponding to a chain graph. Assumptions embedded in our model permit identification from the observed data law, and admit a natural fitting procedure based on the pseudo likelihood approach of (Besag, 1975). We illustrate our approach with a simple simulation study, and an analysis of risk of premature birth in women in Botswana exposed to highly active anti-retroviral therapy.

consistent estimation, data missing non-monotonically, estimator, (8 more...)

Neural Information Processing Systems

Country: Africa > Botswana (0.27)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.39)

Add feedback

Entropy Rate Estimation for Markov Chains with Large State Space

Neural Information Processing SystemsNov-20-2025, 22:42:44 GMT

Entropy estimation is one of the prototypical problems in distribution property testing. To consistently estimate the Shannon entropy of a distribution on $S$ elements with independent samples, the optimal sample complexity scales sublinearly with $S$ as $\Theta(\frac{S}{\log S})$ as shown by Valiant and Valiant \cite{Valiant--Valiant2011}. Extending the theory and algorithms for entropy estimation to dependent data, this paper considers the problem of estimating the entropy rate of a stationary reversible Markov chain with $S$ states from a sample path of $n$ observations.

entropy rate estimation, markov chain, name change, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

On the consistent estimation of optimal Receiver Operating Characteristic (ROC) curve

Neural Information Processing SystemsAug-18-2025, 19:48:56 GMT

We formally introduce the notion of optimal ROC curve over a general model space.

artificial intelligence, machine learning, roc curve, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Ohio > Franklin County > Columbus (0.04)
North America > United States > New Mexico > Los Alamos County > Los Alamos (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Consistent Estimation of Identifiable Nonparametric Mixture Models from Grouped Observations Alexander Ritchie

Neural Information Processing SystemsAug-14-2025, 23:54:25 GMT

Most work on estimating mixture models assumes an iid sampling scheme.

dataset, mixture component, mixture model, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
North America > Canada (0.04)
Europe > Germany > Berlin (0.04)
Asia (0.04)

Industry: Government > Regional Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)
(2 more...)

Add feedback

Consistent Estimation of Identifiable Nonparametric Mixture Models from Grouped Observations

Neural Information Processing SystemsMay-27-2025, 04:58:05 GMT

Recent research has established sufficient conditions for finite mixture models to be identifiable from grouped observations. These conditions allow the mixture components to be nonparametric and have substantial (or even total) overlap. This work proposes an algorithm that consistently estimates any identifiable mixture model from grouped observations. Our analysis leverages an oracle inequality for weighted kernel density estimators of the distribution on groups, together with a general result showing that consistent estimation of the distribution on groups implies consistent estimation of mixture components. A practical implementation is provided for paired observations, and the approach is shown to outperform existing methods, especially when mixture components overlap significantly.

artificial intelligence, consistent estimation, identifiable nonparametric mixture model, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.49)

Add feedback

Filters

Collaborating Authors

consistent estimation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Entropy Rate Estimation for Markov Chains with Large State Space

866d90e0921ac7b024b47d672445a086-Paper.pdf

On the consistent estimation of optimal Receiver Operating Characteristic (ROC) curve

Consistent Estimation for PCA and Sparse Regression with Oblivious Outliers

Consistent Estimation of Identifiable Nonparametric Mixture Models from Grouped Observations

Consistent Estimation of Functions of Data Missing Non-Monotonically and Not at Random

Entropy Rate Estimation for Markov Chains with Large State Space

On the consistent estimation of optimal Receiver Operating Characteristic (ROC) curve

Consistent Estimation of Identifiable Nonparametric Mixture Models from Grouped Observations Alexander Ritchie

Consistent Estimation of Identifiable Nonparametric Mixture Models from Grouped Observations